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A deterministic finite-state automaton (FSA) is an abstract se- 
quential machine that reads the symbols comprising an input 
word one at a time. An FSA is symmetric if its output is in- 
dependent of the order in which the input symbols are read, i.e., 
if the output is invariant under permutations of the input. We 
show how to convert a symmetric FSA A into an automaton- 
like divide-and-conquer process whose intermediate results are 
no larger than the size of ,4's memory. In comparison, a similar 
result for general FS A's has been long known via functional com- 
position, but entails an exponential increase in memory size. The 
new result has applications to parallel processing and symmetric 
FSA networks. 
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1 INTRODUCTION 

One of the simplest models of computation is the deterministic finite state au- 
tomaton (FSA). Although FSAs are often considered to act as solitary com- 
puting devices (e.g., in the classical string matching algorithm of Knuth, Mor- 
ris, and Pratt [6]) they can also be connected together to form a computing 
network (e.g., in cellular automata and the models of 0~|[8]). 

A symmetric automaton is one that will produce the same output even 
if its inputs are permuted. Symmetric FSAs are natural building blocks for 
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fault-tolerant computation networks. In previous work with Vempala [10'| we 
showed there are symmetric FSAs implementing fault-tolerant randomized 
algorithms for the following tasks: implicit 2-approximate census (via an 
algorithm due to Milgram |8|), network search (via breadth-first search and 
greedy traversal), basic connectivity problems (finding all bridges), and leader 
election. 

Precisely, the finite-state symmetric graph automaton (FSSGA) model in- 
troduced in [ 10 1 is that a copy of the same symmetric FSA is placed at every 
node of a graph (network); when a node "activates" to advance its state, it ob- 
tains one input symbol from each neighbour without regard to order. In sum, 
FSSGA are like symmetric cellular automata but generalized in the sense that 
the underlying graph does not have to be regular. Three models of symmetric 
automata are given in ifTUI : in the sequential model each node is a (sequential) 
FSA, in the parallel model each node uses divide-and-conquer on its inputs 
(in a way that will be defined precisely later), and in the mod-thresh model 
each node applies a finite-size formula (analogous to a regular expression) to 
update its state. One of the main results of [ 1 1 is that these three models 
are equivalent; e.g., for any symmetric FSA there exists a divide-and-conquer 
process to compute the same function. Unfortunately, for the particular con- 
struction given in iflOl . an exponential increase in the size of the state space 
is required. 

More generally, an efficient way to simulate any FSA with divide-and- 
conquer has been known for decades. The basic technique is sometimes 
called functional composition as applied to parallel prefix. Ladner and Fis- 
cher used the technique in 1977 (7) on the PRAM model of parallel comput- 
ing; see also [9| for an implementation in mesh networks. The basic idea is 
that for any single character a, the transition of the FSA on that character can 
be viewed as function f a from the FSA's state space back to itself, and the 
computation of the FSA on a string w = W1W2 • • • Wk is essentially deter- 
mined by the composition of functions f w := f Wk o ■ ■ ■ o f W2 o f Wl . In turn, 
this composition problem lends itself easily to divide-and-conquer: break the 
string into two parts w = uv, compute the compositions f u and f v for the 
two parts, and return /„ o /„. Like the transformation of [ 10 1 for symmetric 
automata, the size of intermediate results increases exponentially, since for a 
state space Q there are |Q|'^' functions from Q to Q. 

The main contribution of this paper is that for a symmetric FSA, no in- 
crease in the state space size is necessary. We present the result (Theorem 
|7]l after introducing our notation. The resulting small-state-space divide-and- 
conquer process is applicable to the PRAM setting, so e.g. for symmetric 
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FSAs we are able to decrease the working memory used by the divide-and- 
conquer approaches of 019]. For high-degree FSSGAs and the special case 
of symmetric cellular automata, divide-and-conquer is a natural way for each 
node to read its neighbours' states, as we will illustrate in Section |2 our 
main result permits such divide-and-conquer processes to be more memory- 
efficient. 

2 PRELIMINARIES 

We denote an FSA by the tuple (E, Q, qo, {/crjo-es, O, (3). Here E is a finite 
set called the input alphabet, Q is a finite set called the state space, qo is an 
element of Q called the initial state, each f„ is a function from Q to Q called 
the transition function of a, O is a finite output set, and (3 is an output function 
from Q to O. 

Definition 1 (FSA). An FSA is any tuple A = (E, Q, q Q , {f a } ae ^,0, /3) of 
the form described above. 

Let E* denote the set of all strings over E, and let f o g denote the func- 
tional composition of / and g, defined by (/ o g)(x) = f(g(x)). It is con- 
venient to extend the definition of / to strings via functional composition. 
Namely, for a string w — w\W2 ■ ■ • Wk, define 

JW fw k ° JWk-i o • • • o f W2 o f Wl , 

and by convention, where A denotes the empty string, let f\ be the identity 
function on Q. In particular, we obtain the identity f uv (q) = f v (fu{l)) f° r 
any strings u, v 6 E* and any q 6 Q. Let E + denote the set of nonempty 
strings over E; the empty string is excluded to agree with the divide-and- 
conquer model later on. Our definition of f w affords a concise description of 
computation for an FSA. 

Definition 2 (FSA computation). An FSA A = (Y,,Q,q Q ,{f a } creT:7 0,l3) 
computes the function vj^ : E + — ^ O defined by 

v A (w) := (3(f w (q )). 

Note that the traditional model where the FSA accepts or rejects strings 
depending on the final state can be modeled by setting O = {accept, reject} 
and defining f3(q) — accept iff q is an accepting state. We use the multi- 
output version because it is more natural in some settings, e.g., the FSSGA 
model. 
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We represent a divide-and-conquer automaton by a tuple (£, Q, a, c, O, f3). 
As before E is the input alphabet, Q is the state space, O is the output set and 
(3 is the output function. Here a is an input function from £ to Q and c is 
a combining function from Q x Q to Q. Informally, the divide-and-conquer 
automaton runs according to the following rules: 

1 . apply a to all input characters 

2. combine states arbitrarily using c until only one state q* is left 

3. output f3(q*). 

Our definition will require that the end result of the computation is the same 
no matter how the arbitrary choices of combination are made. 

To give our formal definition, we use a set-valued function \ tnat maps 
each nonempty string to a subset of Q so that q* e x( w ) iff, dividing inputs 
arbitrarily, the input w could produce q* as the final state. We denote the 
length of w by |io|. 

Definition 3 (DCA). Let A' denote the tuple (£, Q, a, c, O, /3) as described 
above. Define XA'( W ) f or w £ ^ + recursively as follows: if\w\ = 1, say w 
consists of the character a, then \A' ( w ) '■= { a (< J )}: otherwise (for \w\ > 2) 

XA'{w) := (J {c(q*,q*) \ q* u £ XA'{u),q* v G XA'{v)} (1) 

(u.v)-.uv— w 

where (u, v) ranges overall partitions of w into two nonempty substrings. We 
say that A' is a divide-and-conquer automaton (DCA) if for all w G S + , 

\{P(q*)\q*&XA'(w)}\ = l- (2) 

The previous definition amounts to saying that the output of a divide-and- 
conquer automaton should be well-defined regardless of how the dividing is 
performed. For a singleton set S let the. member^) be a function that returns 
the element of S, i.e., it "unwraps" the set. 

Definition 4 (DCA computation). A DCA A' = (£, Q, a, c, O, (3) computes 
the function va : £+ — > O defined by 

v A '{w) = the.member({/3(g) | q G XA'{w)}). (3) 

Figure 1 illustrates how a node in an FSA-based computing network could 
make use of the divide-and-conquer methodology. Specifically, when reading 
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the states of all neighbours the node can process and combine inputs from its 
neighbours in parallel rather than one-by-one. As a function of the neigh- 
bourhood size |r| (i.e. the degree) the circuit depicted has depth 0(log |T|) 
and hence this approach would lead to efficient physical implementation for 
large neighbourhoods. 




FIGURE 1 

An FS A in a network updates its state via divide-and-conquer. The node v is activating 
and its neighbours are labeled n. The lines carry values from tail to head, and the boxes 
apply functions, like in a circuit diagram. Each neighbour supplies an input symbol 
and the divide-and-conquer process produces an output symbol which is used by v to 
update its state. 



We denote by the set of all functions from Q to Q. We mentioned the 
following well-known (e.g., (7|) result earlier: 

Theorem 5. Given any FSA A there is a DCA A' such that vj± — vjj , i.e., 
A and A' compute the same function. 

Proof. Define A' = (£, Q« a ^ f a , (A, f 2 ) ^ / 2 o f u 0, 0). □ 
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Conversely, as was observed in iflOl . any divide-and-conquer automaton 
can be easily rewritten in sequential form since a sequential FSA can be 
thought of as conquering one input at a time. 

The particular result we want to prove pertains only to symmetric au- 
tomata, which we now define formally. 

Definition 6. Suppose that A is an FSA or a DCA. We say that A is symmetric 

if for every w G S + and every permutation w' ofw, v A (w) — v A {w'). 

The main result of the present paper is the following, which is a more 
efficient version of Theorem|5]for symmetric FSA's. 

Theorem 7. Given any symmetric FSA A = (£, Q, qo, /, O, ft), there is a 
DCA A' = (£, Q', a, c, O, ft') such that v A = v A , and \Q'\ < \Q\. 

In the next section, we prove a supporting lemma for later use. In Section 
[4] we complete the proof of Theorem|7] In Section|5]we mention some ideas 
for future investigation. 

3 LOOKING INSIDE A SYMMETRIC FSA 

The key to Theorem[7]is to focus on automata with specific irredundany prop- 
erties. Symmetry of an automaton is a black-box property — the definition 
only cares about the correspondence of final outputs when the inputs are per- 
mutations of one another, regardless of the internal structure of the automaton. 
We now describe how this black-box property (symmetry), when combined 
with irredundancy requirements, implies a structural property — namely, that 
the transition functions must commute. 

Definition 8 ([4 |). Let A = (£, Q, q , /, 0, ft) be an FSA and let q £ Q. The 
state q is said to be accessible if for some string w € £*, f w (qo) — <Z- We say 
A is accessible if every state in Q is accessible. 

Definition 9 (H). Let A = (£, Q, q , f, O, ft) be an FSA and let q, q' e Q. 
The states q and q' are said to be distinguishable if for some string w G £*, 
ft(fw(q)) 7^ ft{fw(q'))- We say A is distinguishable if every pair of states in 
Q is distinguishable. 

As we will later make precise, every FSA can be rewritten in an accessible, 
distinguishable way. This gives some general applicability to the following 
lemma. 
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Lemma 10 (Commutativity Lemma). Let A = (S, Q, qo, /, O, (3) be a sym- 
metric FSA that is accessible and distinguishable. Then the functions } CT gE 
commute. 

We defer the proof of the lemma to the end of this section. In order to 
explain how it is useful, we recall the following additional definitions. 

Definition 11 (H). Two automata A, A' are equivalent if they compute the 
same function, i.e. if va = V A'- An FSA A is minimal if for every FSA A' 
equivalent to A A' has at least as many states as A 

It is not hard to see that any minimal FSA must be accessible (or else 
we could remove some states) and distinguishable (or else we could collapse 
some statesfl- It is also not hard to see that for every FSA A there exists a 
minimal equivalent FSA A'; such minimization can be performed algorith- 
mically in poly(\Q\, |S|) time, e.g. using an approach of Hopcroft J5). In 
sum, for any FSA we can efficiently obtain an equivalent FSA meeting the 
conditions of Lemma [TUl which we now prove. 

Proof of Lemma\TU\ Suppose for the sake of contradiction that not all of the 
functions / commute. Then / CT1 (/ CT2 (g)) ^ f^ifadq)) for some ai,a 2 £ 
T,,q £ Q. We want to show that this discrepancy can be "continued" to a 
violation of symmetry. Let q\ denote / CT2 (q)) and q 2 denote f ai (f a2 (q)). 

First, since q is accessible, there exists some string wi such that f wi (go) = 
q. Second, since qi and q 2 are distinguishable, there exists some string w r 
such that (3(f Wr (qi)) ^ fi{fw r {q2))- Now putting things together we have 

PUwivi<T 2 wMo)) = Kf<n<r2w r (q)) = PifwMi))- 

Similarly 

/3(jW<7 2CTl ™ r (3o)) = P{fwA<li)) ^ PifwMi))- 

Hence A outputs different values under the inputs wiO\Oiw r and u^o^ci w r ; 
since these inputs are permutations of one another, this means A is not sym- 
metric. □ 

* Interestingly, the converse is also true: any accessible, distinguishable FSA is minimal. See 
| 4 | for a derivation of this result as a corollary of the Myhill-Nerode theorem; adapting the proof 
from accept/reject automata to our more general model is straightforward. 
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4 PROOF OF THEOREM 7 



We are given that A = (S, Q, qo, f, 0, 0) is a symmetric FSA and without 
loss of generality it is minimal. For each q 6 Q, define r[q] 6 S* to be a 
fixed representative string that generates state q from go, i-e., such that 

/r[g](?o) = 9 

holds. Each r[g] is guaranteed to exist since q is accessible. These r[q] remain 
fixed for the remainder of the proof. 

We need the following claim, which roughly says that every string w is 
interchangeable with the representative string r[f w (qo)]. We know they are 
interchangeable when they are read first, but using the commutativity of the 
f's, we can show they are interchangeable when read later. 

Claim 12. For each w E S* we have f w = f r [f w (q )]- 

Proof. For any q £ Q, alternately applying the definition of r[-] and the com- 
mutativity of the f's, we have 

fw(q) = fw(fr[q](qo)) = fr[q](fw(qo)) 

= fr[q\{fr[f m ( qo )\{qo)) = fr[f w (q )](fr[q]{Qo)) = fr[f w (q )] ((?)■ □ 

4.1 The Construction 

Here we define the divide-and-conquer automaton A' = (E, Q', a, c, O, /?'). 
Namely, let Q' = Q, f3' = j3, define a (a) := fa(qo) and define c(q, g') := 
f r [q'](q). It remains to prove that the construction is correct, i.e., that 1/4 = 
^/ . Our recursive proof uses the idea outlined previously, that each string w 
is essentially interchangeable with r[f w (qo)]. 

Claim 13. For any nonempty string w € E + , the set XA'(w) is a singleton 
andth.e.membeT(xA'(w)) = fw(qo)- 
Proof. We proceed by induction on \w\. 

Base case: If w has length 1, say it consists of the character a, then f w (qo) = 
/o-(<7o)> an d by the definition of %, we have XA'( W ) — {^C ")} = 
{fcr(qo)}- Thus the claim is satisfied. 

Inductive step: Now w has length 2 or more. The induction statement to be 
proved is XA'{w) — {fw{lo)}- Recalling Equation ((T), which defines 
X in this case, this is equivalent to saying that 

for all partitions w = uv of w into two nonempty substrings, 
c(the.member(x^'(w)),the.member(x^'(w))) = f w (qa)- (4) 
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By the induction hypothesis, the left-hand side of is equal to 

c(/t,(<R>), /,(*>))• (5) 

Applying the definition of c, we find that the value © is in turn equal 
to / r [/„ (g )] (/«(?()))• Finally, applying Claim[12]we see that the value 
© is equal to f v (f u (qo)) = fw(qo), as desired. □ 

Proof of Theorem^ As outlined previously, minimizing A makes it accessi- 
ble and distinguishable, without changing va- Now consider the DC A A' as 
defined previously. On any input w 6 E + , using Claim[T3l 

v A ,(w) = /3(the.member(x^'(w))) = (3(f w (q Q )) = va{w). 

Hence A and A' compute the same function (i.e., they are equivalent). 

Since the state space of A' is Q, and since Q could only have gotten 
smaller when A was minimized, the state space of the DCA A' is indeed 
no larger than the state space of the original FSA. □ 

One might question whether any result similar to Theorem[7]is possible if 
we discard the symmetry requirement. The following result gives a negative 
answer to this question and shows that the exponential state space increase of 
Theorem[5]is best possible. 

Proposition 14. For any integer n > 1, there is an n-state FSA A on a 
three-symbol alphabet T, so that any DCA equivalent to A has at least n n 
states. 

Proof. Let Q be a set of n states and £ a set of size 3. Denes [2| showed 
that Q®, viewed as a semigroup under the operation of composition, has a 
generating set of size 3. We choose {f a }aeT, to be this generating set; this 
implies that for every function g : Q — > Q, there is a string w[g] e S* so that 
fw[g] — 9- We define O = Q, /3 to be the identity function, and we choose 
go G Q arbitrarily; this completes the definition of the FSA A. 

Suppose for the sake of contradiction that there exists a DCA A' that 
computes 2/4, and that this DCA's state space Q 1 has \Q'\ < n n . By the 
pigeonhole principle there are two distinct functions 51,(72 G Q so that 
XA' ( w [9i}) H XA' (Mff2]) 7^ 0, since each \A' (•) is a nonempty subset of Q' . 
Let q £ Q denote a state for which gi (q) ^ 92 (<?) and let q 1 S Q' denote any 
element of ^=1,2 XA'{w[g t ]). 

Now let h : Q — > Q be any function for which h(qo) = q. We claim that 
the two input strings u>[/i]u;[(7i] for i = 1, 2 produce different outputs under 
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A and the same output under A', providing the contradiction. To see that the 
outputs under A are different, observe that 

v A {w[h]w[g l ]) = P{f w [h]w[g t ](Qo)) = P(gi(h(q ))) = g % {q) 

and since gi(q) ^ <?2(<7), we are done. To see that the outputs under A' are 
the same, let q' denote any element of \A' ( w [h]) an d notice that c(q' , q') G 
XA' [ w [h]w[gi]) for i — 1,2; then recalling Equations (0 and ([3J, we see that 

V A' k w ^A w \9^\) — V A' (M^-] w [#2]) as claimed. □ 

5 EXTENSIONS 

We mention in this sections some extensions of FSAs and ask if analogues 
of Theorem [7] hold for them. Some of these issues were raised previously in 

m. 

First, the main result of this paper is not suitable in the following natural 
situation. Suppose the input alphabet and state space are both the set of all 
fc-bit binary strings, i.e. £ = Q = {0, l} k , and that the transition function 
f a (q) is some polynomial-time Turing-computable function of a and q (and 
similarly for P). For such an FSA, Va{w) can be computed in \w\ ■ poly(k) 
time. If A is symmetric we can simulate it by a DCA using Theorem|7]but this 
approach takes exponential time in k, since minimizing A requires looking at 
all of its 2 k states. Functional composition (Theorem[5]l has the same issue. 
Thus, the open problem is to determine if a poly(k)-time technique exists to 
convert a symmetric FSA of this type into a DCA. 

Second, a variant of the above model might allow the string lengths to grow 
as some function k{m) of the total number of inputs m. Since the original 
submission of this paper and independently of our work, Feldman et al. J5) 
showed that for this sort of model, an analogue of Theorem Q holds where 
the divide-and-conquer version uses strings of length at most k 2 (m). Their 
construction, like ours, takes exponential time in k(m). 

Finally, the functional composition view of FSAs (e.g., in the proof of The- 
orem[5]l also works for nondeterministic automata and probabilistic automata. 
A result obtained by Feldman et al. [3 1 shows that an analogue of Theorem|7] 
for probabilistic automata is false, while the nondeterministic version appears 
to be an open problem. 
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